Overview

Dataset statistics

Number of variables21
Number of observations29165
Missing cells9027
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.4 MiB
Average record size in memory590.8 B

Variable types

Numeric8
Categorical11
Boolean2

Alerts

Has a mobile phone has constant value "1"Constant
Account age_x is highly overall correlated with Account age_yHigh correlation
Account age_y is highly overall correlated with Account age_xHigh correlation
Children count is highly overall correlated with Family member countHigh correlation
Employment length is highly overall correlated with Employment status and 1 other fieldsHigh correlation
Employment status is highly overall correlated with Employment lengthHigh correlation
Family member count is highly overall correlated with Children countHigh correlation
Gender is highly overall correlated with Job titleHigh correlation
Job title is highly overall correlated with Employment length and 1 other fieldsHigh correlation
Education level is highly imbalanced (50.6%)Imbalance
Dwelling is highly imbalanced (73.3%)Imbalance
Has an email is highly imbalanced (56.3%)Imbalance
Is high risk is highly imbalanced (87.5%)Imbalance
Job title has 9027 (31.0%) missing valuesMissing
ID has unique valuesUnique
Children count has 20143 (69.1%) zerosZeros

Reproduction

Analysis started2026-01-05 18:40:36.854764
Analysis finished2026-01-05 18:40:49.974365
Duration13.12 seconds
Software versionydata-profiling vv4.18.0
Download configurationconfig.json

Variables

ID
Real number (ℝ)

Unique 

Distinct29165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5078231.6
Minimum5008804
Maximum5150485
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size228.0 KiB
2026-01-05T12:40:50.117520image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum5008804
5-th percentile5018455.4
Q15042047
median5074666
Q35114629
95-th percentile5146012.8
Maximum5150485
Range141681
Interquartile range (IQR)72582

Descriptive statistics

Standard deviation41824.001
Coefficient of variation (CV)0.008235938
Kurtosis-1.2095593
Mean5078231.6
Median Absolute Deviation (MAD)38051
Skewness0.084511077
Sum1.4810662 × 1011
Variance1.749247 × 109
MonotonicityNot monotonic
2026-01-05T12:40:50.249584image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50370481
 
< 0.1%
50446301
 
< 0.1%
50790791
 
< 0.1%
51128721
 
< 0.1%
51058581
 
< 0.1%
51004111
 
< 0.1%
50228171
 
< 0.1%
50098111
 
< 0.1%
51139221
 
< 0.1%
50215411
 
< 0.1%
Other values (29155)29155
> 99.9%
ValueCountFrequency (%)
50088041
< 0.1%
50088051
< 0.1%
50088061
< 0.1%
50088081
< 0.1%
50088101
< 0.1%
50088131
< 0.1%
50088141
< 0.1%
50088151
< 0.1%
50088191
< 0.1%
50088211
< 0.1%
ValueCountFrequency (%)
51504851
< 0.1%
51504821
< 0.1%
51504811
< 0.1%
51504801
< 0.1%
51504781
< 0.1%
51504771
< 0.1%
51504681
< 0.1%
51504651
< 0.1%
51504641
< 0.1%
51504631
< 0.1%

Gender
Categorical

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
F
19549 
M
9616 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F19549
67.0%
M9616
33.0%

Length

2026-01-05T12:40:50.398175image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:50.466549image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
f19549
67.0%
m9616
33.0%

Most occurring characters

ValueCountFrequency (%)
F19549
67.0%
M9616
33.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
F19549
67.0%
M9616
33.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
F19549
67.0%
M9616
33.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
F19549
67.0%
M9616
33.0%

Has a car
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.6 KiB
False
18128 
True
11037 
ValueCountFrequency (%)
False18128
62.2%
True11037
37.8%
2026-01-05T12:40:50.511738image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.6 KiB
True
19557 
False
9608 
ValueCountFrequency (%)
True19557
67.1%
False9608
32.9%
2026-01-05T12:40:50.562809image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Children count
Real number (ℝ)

High correlation  Zeros 

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.43079033
Minimum0
Maximum19
Zeros20143
Zeros (%)69.1%
Negative0
Negative (%)0.0%
Memory size228.0 KiB
2026-01-05T12:40:50.627901image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum19
Range19
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.74188219
Coefficient of variation (CV)1.7221422
Kurtosis23.798772
Mean0.43079033
Median Absolute Deviation (MAD)0
Skewness2.5929624
Sum12564
Variance0.55038919
MonotonicityNot monotonic
2026-01-05T12:40:50.780280image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
020143
69.1%
16003
 
20.6%
22624
 
9.0%
3323
 
1.1%
452
 
0.2%
515
 
0.1%
72
 
< 0.1%
142
 
< 0.1%
191
 
< 0.1%
ValueCountFrequency (%)
020143
69.1%
16003
 
20.6%
22624
 
9.0%
3323
 
1.1%
452
 
0.2%
515
 
0.1%
72
 
< 0.1%
142
 
< 0.1%
191
 
< 0.1%
ValueCountFrequency (%)
191
 
< 0.1%
142
 
< 0.1%
72
 
< 0.1%
515
 
0.1%
452
 
0.2%
3323
 
1.1%
22624
 
9.0%
16003
 
20.6%
020143
69.1%

Income
Real number (ℝ)

Distinct259
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean186890.39
Minimum27000
Maximum1575000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size228.0 KiB
2026-01-05T12:40:50.897283image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum27000
5-th percentile76500
Q1121500
median157500
Q3225000
95-th percentile360000
Maximum1575000
Range1548000
Interquartile range (IQR)103500

Descriptive statistics

Standard deviation101409.64
Coefficient of variation (CV)0.54261563
Kurtosis18.289145
Mean186890.39
Median Absolute Deviation (MAD)45000
Skewness2.7571154
Sum5.4506581 × 109
Variance1.0283916 × 1010
MonotonicityNot monotonic
2026-01-05T12:40:51.018253image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1350003468
 
11.9%
1800002487
 
8.5%
1575002469
 
8.5%
2250002373
 
8.1%
1125002359
 
8.1%
2025001781
 
6.1%
900001395
 
4.8%
2700001344
 
4.6%
315000795
 
2.7%
247500686
 
2.4%
Other values (249)10008
34.3%
ValueCountFrequency (%)
270001
 
< 0.1%
292503
 
< 0.1%
301503
 
< 0.1%
3150015
0.1%
31531.53
 
< 0.1%
324003
 
< 0.1%
333009
< 0.1%
337501
 
< 0.1%
360003
 
< 0.1%
369006
 
< 0.1%
ValueCountFrequency (%)
15750007
 
< 0.1%
13500005
 
< 0.1%
11250003
 
< 0.1%
9900003
 
< 0.1%
9450003
 
< 0.1%
90000028
0.1%
81000013
< 0.1%
7875001
 
< 0.1%
7650005
 
< 0.1%
7425004
 
< 0.1%

Employment status
Categorical

High correlation 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.7 MiB
Working
15056 
Commercial associate
6801 
Pensioner
4920 
State servant
2381 
Student
 
7

Length

Max length20
Median length7
Mean length10.8587
Min length7

Characters and Unicode

Total characters316694
Distinct characters21
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWorking
2nd rowCommercial associate
3rd rowCommercial associate
4th rowCommercial associate
5th rowWorking

Common Values

ValueCountFrequency (%)
Working15056
51.6%
Commercial associate6801
23.3%
Pensioner4920
 
16.9%
State servant2381
 
8.2%
Student7
 
< 0.1%

Length

2026-01-05T12:40:51.178244image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:51.296448image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
working15056
39.3%
commercial6801
17.7%
associate6801
17.7%
pensioner4920
 
12.8%
state2381
 
6.2%
servant2381
 
6.2%
student7
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o33578
10.6%
i33578
10.6%
r29158
 
9.2%
e28211
 
8.9%
n27284
 
8.6%
a25165
 
7.9%
s20903
 
6.6%
k15056
 
4.8%
W15056
 
4.8%
g15056
 
4.8%
Other values (11)73649
23.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)316694
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o33578
10.6%
i33578
10.6%
r29158
 
9.2%
e28211
 
8.9%
n27284
 
8.6%
a25165
 
7.9%
s20903
 
6.6%
k15056
 
4.8%
W15056
 
4.8%
g15056
 
4.8%
Other values (11)73649
23.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)316694
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o33578
10.6%
i33578
10.6%
r29158
 
9.2%
e28211
 
8.9%
n27284
 
8.6%
a25165
 
7.9%
s20903
 
6.6%
k15056
 
4.8%
W15056
 
4.8%
g15056
 
4.8%
Other values (11)73649
23.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)316694
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o33578
10.6%
i33578
10.6%
r29158
 
9.2%
e28211
 
8.9%
n27284
 
8.6%
a25165
 
7.9%
s20903
 
6.6%
k15056
 
4.8%
W15056
 
4.8%
g15056
 
4.8%
Other values (11)73649
23.3%

Education level
Categorical

Imbalance 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.1 MiB
Secondary / secondary special
19803 
Higher education
7910 
Incomplete higher
 
1129
Lower secondary
 
298
Academic degree
 
25

Length

Max length29
Median length29
Mean length24.85462
Min length15

Characters and Unicode

Total characters724885
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSecondary / secondary special
2nd rowHigher education
3rd rowSecondary / secondary special
4th rowHigher education
5th rowSecondary / secondary special

Common Values

ValueCountFrequency (%)
Secondary / secondary special19803
67.9%
Higher education7910
 
27.1%
Incomplete higher1129
 
3.9%
Lower secondary298
 
1.0%
Academic degree25
 
0.1%

Length

2026-01-05T12:40:51.441389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:51.525728image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
secondary39904
40.7%
19803
20.2%
special19803
20.2%
higher9039
 
9.2%
education7910
 
8.1%
incomplete1129
 
1.2%
lower298
 
0.3%
academic25
 
< 0.1%
degree25
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e79312
10.9%
c68796
9.5%
68771
9.5%
a67642
9.3%
r49266
 
6.8%
o49241
 
6.8%
n48943
 
6.8%
d47864
 
6.6%
y39904
 
5.5%
s39904
 
5.5%
Other values (15)165242
22.8%

Most occurring categories

ValueCountFrequency (%)
(unknown)724885
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e79312
10.9%
c68796
9.5%
68771
9.5%
a67642
9.3%
r49266
 
6.8%
o49241
 
6.8%
n48943
 
6.8%
d47864
 
6.6%
y39904
 
5.5%
s39904
 
5.5%
Other values (15)165242
22.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown)724885
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e79312
10.9%
c68796
9.5%
68771
9.5%
a67642
9.3%
r49266
 
6.8%
o49241
 
6.8%
n48943
 
6.8%
d47864
 
6.6%
y39904
 
5.5%
s39904
 
5.5%
Other values (15)165242
22.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown)724885
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e79312
10.9%
c68796
9.5%
68771
9.5%
a67642
9.3%
r49266
 
6.8%
o49241
 
6.8%
n48943
 
6.8%
d47864
 
6.6%
y39904
 
5.5%
s39904
 
5.5%
Other values (15)165242
22.8%

Marital status
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.6 MiB
Married
20044 
Single / not married
3864 
Civil marriage
2312 
Separated
 
1712
Widow
 
1233

Length

Max length20
Median length7
Mean length9.3100977
Min length5

Characters and Unicode

Total characters271529
Distinct characters20
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMarried
2nd rowSingle / not married
3rd rowMarried
4th rowSingle / not married
5th rowSeparated

Common Values

ValueCountFrequency (%)
Married20044
68.7%
Single / not married3864
 
13.2%
Civil marriage2312
 
7.9%
Separated1712
 
5.9%
Widow1233
 
4.2%

Length

2026-01-05T12:40:51.642823image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:51.735278image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
married23908
55.5%
single3864
 
9.0%
3864
 
9.0%
not3864
 
9.0%
civil2312
 
5.4%
marriage2312
 
5.4%
separated1712
 
4.0%
widow1233
 
2.9%

Most occurring characters

ValueCountFrequency (%)
r54152
19.9%
i35941
13.2%
e33508
12.3%
a31956
11.8%
d26853
9.9%
M20044
 
7.4%
13904
 
5.1%
n7728
 
2.8%
l6176
 
2.3%
g6176
 
2.3%
Other values (10)35091
12.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)271529
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r54152
19.9%
i35941
13.2%
e33508
12.3%
a31956
11.8%
d26853
9.9%
M20044
 
7.4%
13904
 
5.1%
n7728
 
2.8%
l6176
 
2.3%
g6176
 
2.3%
Other values (10)35091
12.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)271529
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r54152
19.9%
i35941
13.2%
e33508
12.3%
a31956
11.8%
d26853
9.9%
M20044
 
7.4%
13904
 
5.1%
n7728
 
2.8%
l6176
 
2.3%
g6176
 
2.3%
Other values (10)35091
12.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)271529
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r54152
19.9%
i35941
13.2%
e33508
12.3%
a31956
11.8%
d26853
9.9%
M20044
 
7.4%
13904
 
5.1%
n7728
 
2.8%
l6176
 
2.3%
g6176
 
2.3%
Other values (10)35091
12.9%

Dwelling
Categorical

Imbalance 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
House / apartment
26059 
With parents
 
1406
Municipal apartment
 
912
Rented apartment
 
453
Office apartment
 
208

Length

Max length19
Median length17
Mean length16.790125
Min length12

Characters and Unicode

Total characters489684
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWith parents
2nd rowHouse / apartment
3rd rowHouse / apartment
4th rowHouse / apartment
5th rowHouse / apartment

Common Values

ValueCountFrequency (%)
House / apartment26059
89.4%
With parents1406
 
4.8%
Municipal apartment912
 
3.1%
Rented apartment453
 
1.6%
Office apartment208
 
0.7%
Co-op apartment127
 
0.4%

Length

2026-01-05T12:40:51.864240image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:51.965901image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
apartment27759
32.9%
house26059
30.9%
26059
30.9%
with1406
 
1.7%
parents1406
 
1.7%
municipal912
 
1.1%
rented453
 
0.5%
office208
 
0.2%
co-op127
 
0.2%

Most occurring characters

ValueCountFrequency (%)
t58783
12.0%
a57836
11.8%
e56338
11.5%
55224
11.3%
n30530
 
6.2%
p30204
 
6.2%
r29165
 
6.0%
m27759
 
5.7%
s27465
 
5.6%
u26971
 
5.5%
Other values (15)89409
18.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)489684
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t58783
12.0%
a57836
11.8%
e56338
11.5%
55224
11.3%
n30530
 
6.2%
p30204
 
6.2%
r29165
 
6.0%
m27759
 
5.7%
s27465
 
5.6%
u26971
 
5.5%
Other values (15)89409
18.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)489684
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t58783
12.0%
a57836
11.8%
e56338
11.5%
55224
11.3%
n30530
 
6.2%
p30204
 
6.2%
r29165
 
6.0%
m27759
 
5.7%
s27465
 
5.6%
u26971
 
5.5%
Other values (15)89409
18.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)489684
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t58783
12.0%
a57836
11.8%
e56338
11.5%
55224
11.3%
n30530
 
6.2%
p30204
 
6.2%
r29165
 
6.0%
m27759
 
5.7%
s27465
 
5.6%
u26971
 
5.5%
Other values (15)89409
18.3%

Age
Real number (ℝ)

Distinct6794
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-15979.477
Minimum-25152
Maximum-7705
Zeros0
Zeros (%)0.0%
Negative29165
Negative (%)100.0%
Memory size228.0 KiB
2026-01-05T12:40:52.093968image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-25152
5-th percentile-23021
Q1-19444
median-15565
Q3-12475
95-th percentile-9873
Maximum-7705
Range17447
Interquartile range (IQR)6969

Descriptive statistics

Standard deviation4202.9975
Coefficient of variation (CV)-0.26302471
Kurtosis-1.0433005
Mean-15979.477
Median Absolute Deviation (MAD)3424
Skewness-0.18225185
Sum-4.6604146 × 108
Variance17665188
MonotonicityNot monotonic
2026-01-05T12:40:52.266577image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-1551944
 
0.2%
-1267644
 
0.2%
-1689633
 
0.1%
-1676826
 
0.1%
-1605326
 
0.1%
-1440025
 
0.1%
-1412224
 
0.1%
-1466724
 
0.1%
-1112624
 
0.1%
-2286724
 
0.1%
Other values (6784)28871
99.0%
ValueCountFrequency (%)
-251521
 
< 0.1%
-251403
< 0.1%
-250991
 
< 0.1%
-250881
 
< 0.1%
-250102
< 0.1%
-249631
 
< 0.1%
-249463
< 0.1%
-249324
< 0.1%
-249143
< 0.1%
-248781
 
< 0.1%
ValueCountFrequency (%)
-77051
 
< 0.1%
-77231
 
< 0.1%
-77573
< 0.1%
-79592
< 0.1%
-79801
 
< 0.1%
-80414
< 0.1%
-80541
 
< 0.1%
-80562
< 0.1%
-80691
 
< 0.1%
-80762
< 0.1%

Employment length
Real number (ℝ)

High correlation 

Distinct3483
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59257.761
Minimum-15713
Maximum365243
Zeros0
Zeros (%)0.0%
Negative24257
Negative (%)83.2%
Memory size228.0 KiB
2026-01-05T12:40:52.448996image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-15713
5-th percentile-7264
Q1-3153
median-1557
Q3-412
95-th percentile365243
Maximum365243
Range380956
Interquartile range (IQR)2741

Descriptive statistics

Standard deviation137655.88
Coefficient of variation (CV)2.3230018
Kurtosis1.1433571
Mean59257.761
Median Absolute Deviation (MAD)1309
Skewness1.7724256
Sum1.7282526 × 109
Variance1.8949142 × 1010
MonotonicityNot monotonic
2026-01-05T12:40:52.613509image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3652434908
 
16.8%
-40161
 
0.2%
-20055
 
0.2%
-208753
 
0.2%
-153951
 
0.2%
-167847
 
0.2%
-108147
 
0.2%
-253146
 
0.2%
-116045
 
0.2%
-30944
 
0.2%
Other values (3473)23808
81.6%
ValueCountFrequency (%)
-157131
 
< 0.1%
-156613
 
< 0.1%
-152271
 
< 0.1%
-150722
 
< 0.1%
-1503813
< 0.1%
-148876
< 0.1%
-148106
< 0.1%
-147752
 
< 0.1%
-145364
 
< 0.1%
-144735
 
< 0.1%
ValueCountFrequency (%)
3652434908
16.8%
-172
 
< 0.1%
-651
 
< 0.1%
-661
 
< 0.1%
-702
 
< 0.1%
-711
 
< 0.1%
-7314
 
< 0.1%
-781
 
< 0.1%
-791
 
< 0.1%
-881
 
< 0.1%

Has a mobile phone
Categorical

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
1
29165 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
129165
100.0%

Length

2026-01-05T12:40:52.738940image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:52.817185image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
129165
100.0%

Most occurring characters

ValueCountFrequency (%)
129165
100.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
129165
100.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
129165
100.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
129165
100.0%

Has a work phone
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
22623 
1
6542 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Length

2026-01-05T12:40:52.902966image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:52.987154image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Most occurring characters

ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
022623
77.6%
16542
 
22.4%

Has a phone
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
20562 
1
8603 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Length

2026-01-05T12:40:53.092883image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:53.190604image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Most occurring characters

ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
020562
70.5%
18603
29.5%

Has an email
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
26532 
1
 
2633

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Length

2026-01-05T12:40:53.304563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:53.378820image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Most occurring characters

ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
026532
91.0%
12633
 
9.0%

Job title
Categorical

High correlation  Missing 

Distinct18
Distinct (%)0.1%
Missing9027
Missing (%)31.0%
Memory size1.6 MiB
Laborers
5004 
Core staff
2866 
Sales staff
2773 
Managers
2422 
Drivers
1722 
Other values (13)
5351 

Length

Max length21
Median length20
Mean length10.533916
Min length7

Characters and Unicode

Total characters212132
Distinct characters36
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCore staff
2nd rowAccountants
3rd rowLaborers
4th rowManagers
5th rowAccountants

Common Values

ValueCountFrequency (%)
Laborers5004
17.2%
Core staff2866
 
9.8%
Sales staff2773
 
9.5%
Managers2422
 
8.3%
Drivers1722
 
5.9%
High skill tech staff1133
 
3.9%
Accountants998
 
3.4%
Medicine staff956
 
3.3%
Cooking staff521
 
1.8%
Security staff464
 
1.6%
Other values (8)1279
 
4.4%
(Missing)9027
31.0%

Length

2026-01-05T12:40:53.503892image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
staff9672
29.7%
laborers5142
15.8%
core2866
 
8.8%
sales2773
 
8.5%
managers2422
 
7.4%
drivers1722
 
5.3%
high1133
 
3.5%
skill1133
 
3.5%
tech1133
 
3.5%
accountants998
 
3.1%
Other values (13)3567
 
11.0%

Most occurring characters

ValueCountFrequency (%)
a24637
11.6%
s24596
11.6%
r20552
9.7%
e20460
9.6%
f19344
 
9.1%
t13921
 
6.6%
12423
 
5.9%
o10186
 
4.8%
i8271
 
3.9%
n6932
 
3.3%
Other values (26)50810
24.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)212132
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a24637
11.6%
s24596
11.6%
r20552
9.7%
e20460
9.6%
f19344
 
9.1%
t13921
 
6.6%
12423
 
5.9%
o10186
 
4.8%
i8271
 
3.9%
n6932
 
3.3%
Other values (26)50810
24.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)212132
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a24637
11.6%
s24596
11.6%
r20552
9.7%
e20460
9.6%
f19344
 
9.1%
t13921
 
6.6%
12423
 
5.9%
o10186
 
4.8%
i8271
 
3.9%
n6932
 
3.3%
Other values (26)50810
24.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)212132
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a24637
11.6%
s24596
11.6%
r20552
9.7%
e20460
9.6%
f19344
 
9.1%
t13921
 
6.6%
12423
 
5.9%
o10186
 
4.8%
i8271
 
3.9%
n6932
 
3.3%
Other values (26)50810
24.0%

Family member count
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1975313
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size228.0 KiB
2026-01-05T12:40:54.061916image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q33
95-th percentile4
Maximum20
Range19
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.91218872
Coefficient of variation (CV)0.41509704
Kurtosis8.6454749
Mean2.1975313
Median Absolute Deviation (MAD)0
Skewness1.3103351
Sum64091
Variance0.83208827
MonotonicityNot monotonic
2026-01-05T12:40:54.181395image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
215552
53.3%
15613
 
19.2%
35121
 
17.6%
42503
 
8.6%
5309
 
1.1%
648
 
0.2%
714
 
< 0.1%
92
 
< 0.1%
152
 
< 0.1%
201
 
< 0.1%
ValueCountFrequency (%)
15613
 
19.2%
215552
53.3%
35121
 
17.6%
42503
 
8.6%
5309
 
1.1%
648
 
0.2%
714
 
< 0.1%
92
 
< 0.1%
152
 
< 0.1%
201
 
< 0.1%
ValueCountFrequency (%)
201
 
< 0.1%
152
 
< 0.1%
92
 
< 0.1%
714
 
< 0.1%
648
 
0.2%
5309
 
1.1%
42503
 
8.6%
35121
 
17.6%
215552
53.3%
15613
 
19.2%

Account age_x
Real number (ℝ)

High correlation 

Distinct61
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-26.137734
Minimum-60
Maximum0
Zeros247
Zeros (%)0.8%
Negative28918
Negative (%)99.2%
Memory size228.0 KiB
2026-01-05T12:40:54.336724image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-60
5-th percentile-55
Q1-39
median-24
Q3-12
95-th percentile-3
Maximum0
Range60
Interquartile range (IQR)27

Descriptive statistics

Standard deviation16.486702
Coefficient of variation (CV)-0.63076248
Kurtosis-1.0342853
Mean-26.137734
Median Absolute Deviation (MAD)14
Skewness-0.28850885
Sum-762307
Variance271.81133
MonotonicityNot monotonic
2026-01-05T12:40:54.464156image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-7690
 
2.4%
-6669
 
2.3%
-17659
 
2.3%
-5656
 
2.2%
-8655
 
2.2%
-10645
 
2.2%
-11642
 
2.2%
-16642
 
2.2%
-9629
 
2.2%
-12628
 
2.2%
Other values (51)22650
77.7%
ValueCountFrequency (%)
-60249
0.9%
-59250
0.9%
-58270
0.9%
-57244
0.8%
-56278
1.0%
-55285
1.0%
-54281
1.0%
-53304
1.0%
-52367
1.3%
-51385
1.3%
ValueCountFrequency (%)
0247
 
0.8%
-1444
1.5%
-2519
1.8%
-3626
2.1%
-4625
2.1%
-5656
2.2%
-6669
2.3%
-7690
2.4%
-8655
2.2%
-9629
2.2%

Is high risk
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
28666 
1
 
499

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters29165
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Length

2026-01-05T12:40:54.578022image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-05T12:40:54.658755image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Most occurring characters

ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)29165
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
028666
98.3%
1499
 
1.7%

Account age_y
Real number (ℝ)

High correlation 

Distinct61
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-26.137734
Minimum-60
Maximum0
Zeros247
Zeros (%)0.8%
Negative28918
Negative (%)99.2%
Memory size228.0 KiB
2026-01-05T12:40:54.756262image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-60
5-th percentile-55
Q1-39
median-24
Q3-12
95-th percentile-3
Maximum0
Range60
Interquartile range (IQR)27

Descriptive statistics

Standard deviation16.486702
Coefficient of variation (CV)-0.63076248
Kurtosis-1.0342853
Mean-26.137734
Median Absolute Deviation (MAD)14
Skewness-0.28850885
Sum-762307
Variance271.81133
MonotonicityNot monotonic
2026-01-05T12:40:54.902758image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-7690
 
2.4%
-6669
 
2.3%
-17659
 
2.3%
-5656
 
2.2%
-8655
 
2.2%
-10645
 
2.2%
-11642
 
2.2%
-16642
 
2.2%
-9629
 
2.2%
-12628
 
2.2%
Other values (51)22650
77.7%
ValueCountFrequency (%)
-60249
0.9%
-59250
0.9%
-58270
0.9%
-57244
0.8%
-56278
1.0%
-55285
1.0%
-54281
1.0%
-53304
1.0%
-52367
1.3%
-51385
1.3%
ValueCountFrequency (%)
0247
 
0.8%
-1444
1.5%
-2519
1.8%
-3626
2.1%
-4625
2.1%
-5656
2.2%
-6669
2.3%
-7690
2.4%
-8655
2.2%
-9629
2.2%

Interactions

2026-01-05T12:40:48.454946image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.264599image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.317319image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.408737image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.472420image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.525586image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.447523image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.450487image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.573020image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.384991image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.429772image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.546091image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.593533image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.674320image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.562139image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.583645image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.689420image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.549099image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.591637image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.694935image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.730030image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.834285image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.680568image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.709682image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.799304image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.680367image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.740965image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.813410image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.879353image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.951233image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.795060image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.849428image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.907839image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.790046image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.895292image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.957360image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.995530image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:45.986868image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.914251image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.956203image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:49.013320image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:40.902632image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.024151image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.097047image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.123800image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.102103image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.060947image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.069757image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:49.123679image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.051100image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.140741image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.242591image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.275816image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.220809image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.183892image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.209507image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:49.276812image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:41.193126image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:42.294173image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:43.354074image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:44.404095image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:46.336546image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:47.314897image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-05T12:40:48.342967image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2026-01-05T12:40:55.041079image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Account age_xAccount age_yAgeChildren countDwellingEducation levelEmployment lengthEmployment statusFamily member countGenderHas a carHas a phoneHas a propertyHas a work phoneHas an emailIDIncomeIs high riskJob titleMarital status
Account age_x1.0001.0000.057-0.0050.0130.0120.0770.015-0.0270.0130.0440.0260.0140.0240.018-0.002-0.0260.0640.0250.030
Account age_y1.0001.0000.057-0.0050.0130.0120.0770.015-0.0270.0130.0440.0260.0140.0240.018-0.002-0.0260.0640.0250.030
Age0.0570.0571.0000.3790.1110.123-0.2090.3790.3040.2080.1630.0660.1360.2030.1080.0530.0950.0180.0960.167
Children count-0.005-0.0050.3791.0000.0300.017-0.1420.0710.8250.0630.0860.0200.0070.0560.0040.0270.0430.0000.0580.078
Dwelling0.0130.0130.1110.0301.0000.0510.1140.0620.0670.0830.0410.0400.2040.0390.0270.0320.0520.0080.0720.055
Education level0.0120.0120.1230.0170.0511.0000.1470.0960.0280.0140.1060.0570.0420.0460.0950.0420.1090.0050.2040.047
Employment length0.0770.077-0.209-0.1420.1140.1471.0000.998-0.1450.1750.1540.0100.0960.2420.086-0.008-0.1620.0001.0000.211
Employment status0.0150.0150.3790.0710.0620.0960.9981.0000.1200.1900.1590.0120.0980.2540.1090.0470.0990.0120.1780.108
Family member count-0.027-0.0270.3040.8250.0670.028-0.1450.1201.0000.1050.1150.0250.0180.0550.0270.0260.0240.0060.0590.155
Gender0.0130.0130.2080.0630.0830.0140.1750.1900.1051.0000.3600.0290.0470.0610.0000.0500.2010.0150.5580.164
Has a car0.0440.0440.1630.0860.0410.1060.1540.1590.1150.3601.0000.0100.0110.0170.0160.0580.2060.0000.2720.152
Has a phone0.0260.0260.0660.0200.0400.0570.0100.0120.0250.0290.0101.0000.0650.3120.0100.0650.0460.0000.0670.042
Has a property0.0140.0140.1360.0070.2040.0420.0960.0980.0180.0470.0110.0651.0000.2100.0520.1840.0410.0250.0480.033
Has a work phone0.0240.0240.2030.0560.0390.0460.2420.2540.0550.0610.0170.3120.2101.0000.0350.1270.0350.0000.0620.068
Has an email0.0180.0180.1080.0040.0270.0950.0860.1090.0270.0000.0160.0100.0520.0351.0000.1650.0910.0000.0890.029
ID-0.002-0.0020.0530.0270.0320.042-0.0080.0470.0260.0500.0580.0650.1840.1270.1651.000-0.0220.0160.0640.042
Income-0.026-0.0260.0950.0430.0520.109-0.1620.0990.0240.2010.2060.0460.0410.0350.091-0.0221.0000.0000.1120.032
Is high risk0.0640.0640.0180.0000.0080.0050.0000.0120.0060.0150.0000.0000.0250.0000.0000.0160.0001.0000.0280.022
Job title0.0250.0250.0960.0580.0720.2041.0000.1780.0590.5580.2720.0670.0480.0620.0890.0640.1120.0281.0000.108
Marital status0.0300.0300.1670.0780.0550.0470.2110.1080.1550.1640.1520.0420.0330.0680.0290.0420.0320.0220.1081.000

Missing values

2026-01-05T12:40:49.501787image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2026-01-05T12:40:49.752317image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

IDGenderHas a carHas a propertyChildren countIncomeEmployment statusEducation levelMarital statusDwellingAgeEmployment lengthHas a mobile phoneHas a work phoneHas a phoneHas an emailJob titleFamily member countAccount age_xIs high riskAccount age_y
05037048MYY0135000.0WorkingSecondary / secondary specialMarriedWith parents-16271-31111000Core staff2.0-17.00-17
15044630FYN1135000.0Commercial associateHigher educationSingle / not marriedHouse / apartment-10130-16511000Accountants2.0-1.00-1
25079079FNY2180000.0Commercial associateSecondary / secondary specialMarriedHouse / apartment-12821-56571000Laborers4.0-38.00-38
35112872FYY0360000.0Commercial associateHigher educationSingle / not marriedHouse / apartment-20929-20461001Managers1.0-11.00-11
45105858FNN0270000.0WorkingSecondary / secondary specialSeparatedHouse / apartment-16207-5151010NaN1.0-41.00-41
55100411FYY0135000.0WorkingSecondary / secondary specialMarriedHouse / apartment-13251-38391100Accountants2.0-1.00-1
65022817MYY0202500.0WorkingSecondary / secondary specialMarriedHouse / apartment-17262-16171000Core staff2.0-16.00-16
75009811FNN1202500.0WorkingSecondary / secondary specialMarriedHouse / apartment-11813-32661110Sales staff3.0-21.00-21
85113922FNN090000.0PensionerSecondary / secondary specialSingle / not marriedMunicipal apartment-234783652431000NaN1.0-50.00-50
95021541FYN1306000.0WorkingHigher educationMarriedHouse / apartment-9310-16781000NaN3.0-13.00-13
IDGenderHas a carHas a propertyChildren countIncomeEmployment statusEducation levelMarital statusDwellingAgeEmployment lengthHas a mobile phoneHas a work phoneHas a phoneHas an emailJob titleFamily member countAccount age_xIs high riskAccount age_y
291555021871FYY1315000.0State servantHigher educationWidowHouse / apartment-18233-4251011NaN2.0-30.00-30
291565009779MNN0135000.0WorkingSecondary / secondary specialSeparatedHouse / apartment-14118-31741000Laborers1.0-4.00-4
291575010913FYY081000.0PensionerHigher educationMarriedHouse / apartment-203993652431000NaN2.0-43.00-43
291585065502FYN1135000.0WorkingHigher educationMarriedMunicipal apartment-12523-24821000Managers3.0-13.00-13
291595091339FNY0135000.0Commercial associateSecondary / secondary specialMarriedHouse / apartment-11088-14471010Cooking staff2.0-3.00-3
291605067139FNY0112500.0PensionerSecondary / secondary specialSingle / not marriedHouse / apartment-234003652431011NaN1.0-5.00-5
291615029193FNY1135000.0Commercial associateSecondary / secondary specialMarriedHouse / apartment-15532-82561000Core staff3.0-24.00-24
291625047710FNY076500.0WorkingSecondary / secondary specialMarriedHouse / apartment-17782-32911110Managers2.0-29.00-29
291635009886FNY0157500.0PensionerSecondary / secondary specialCivil marriageHouse / apartment-216353652431010NaN2.0-37.00-37
291645062632FNY0585000.0Commercial associateSecondary / secondary specialMarriedHouse / apartment-18858-20101010NaN2.0-43.00-43